METAe—Automated Encoding of Digitized Texts
Identifieur interne : 000244 ( Main/Exploration ); précédent : 000243; suivant : 000245METAe—Automated Encoding of Digitized Texts
Auteurs : Birgit Stehno [Autriche] ; Alexander Egger [Autriche] ; Gregor Retti [Autriche]Source :
- Literary and Linguistic Computing [ 0268-1145 ] ; 2003-04.
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Automatisation, Codage, Normalisation, Norme, Numérisation.
English descriptors
- KwdEn :
Abstract
This paper explains why and how the digitization project METAe applies METS (Metadata Encoding and Transmission Standard) as encoding scheme for automatically extracted metadata. In contrast to TEI (Text Encoding Initiative) and other markup languages, METS allows encoding of the whole range of structural, descriptive, and administrative metadata in a systematic way. As the METS schema permits the integration of other existing standards, it provides a highly flexible output that can be converted easily to the individual needs of digital libraries. An innovative aspect of the METAe data structure is the ALTO file (‘Analysed layout and text object’), which contains the layout structures as well as the text passages of book pages. Structural maps of the METS schema are used to compose the logical and the physical structures out of ALTO and image files.
Url:
DOI: 10.1093/llc/18.1.77
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000057
- to stream Istex, to step Curation: 000057
- to stream Istex, to step Checkpoint: 000201
- to stream Main, to step Merge: 000265
- to stream PascalFrancis, to step Corpus: 000043
- to stream PascalFrancis, to step Curation: 000010
- to stream PascalFrancis, to step Checkpoint: 000035
- to stream Main, to step Merge: 000280
- to stream Main, to step Curation: 000244
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">METAe—Automated Encoding of Digitized Texts</title>
<author><name sortKey="Stehno, Birgit" sort="Stehno, Birgit" uniqKey="Stehno B" first="Birgit" last="Stehno">Birgit Stehno</name>
</author>
<author><name sortKey="Egger, Alexander" sort="Egger, Alexander" uniqKey="Egger A" first="Alexander" last="Egger">Alexander Egger</name>
</author>
<author><name sortKey="Retti, Gregor" sort="Retti, Gregor" uniqKey="Retti G" first="Gregor" last="Retti">Gregor Retti</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:C82C92A176F34CD3AE19FA346A45C9E863DBF21C</idno>
<date when="2003" year="2003">2003</date>
<idno type="doi">10.1093/llc/18.1.77</idno>
<idno type="url">https://api.istex.fr/document/C82C92A176F34CD3AE19FA346A45C9E863DBF21C/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000057</idno>
<idno type="wicri:Area/Istex/Curation">000057</idno>
<idno type="wicri:Area/Istex/Checkpoint">000201</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000201</idno>
<idno type="wicri:doubleKey">0268-1145:2003:Stehno B:metae:automated:encoding</idno>
<idno type="wicri:Area/Main/Merge">000265</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:04-0078653</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000043</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000010</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000035</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000035</idno>
<idno type="wicri:doubleKey">0268-1145:2003:Stehno B:metae:automated:encoding</idno>
<idno type="wicri:Area/Main/Merge">000280</idno>
<idno type="wicri:Area/Main/Curation">000244</idno>
<idno type="wicri:Area/Main/Exploration">000244</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">METAe—Automated Encoding of Digitized Texts</title>
<author><name sortKey="Stehno, Birgit" sort="Stehno, Birgit" uniqKey="Stehno B" first="Birgit" last="Stehno">Birgit Stehno</name>
<affiliation wicri:level="1"><country xml:lang="fr">Autriche</country>
<wicri:regionArea>University of Innsbruck, Innsbruck</wicri:regionArea>
<wicri:noRegion>Innsbruck</wicri:noRegion>
</affiliation>
<affiliation><wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Egger, Alexander" sort="Egger, Alexander" uniqKey="Egger A" first="Alexander" last="Egger">Alexander Egger</name>
<affiliation wicri:level="1"><country xml:lang="fr">Autriche</country>
<wicri:regionArea>University of Graz, Graz</wicri:regionArea>
<wicri:noRegion>Graz</wicri:noRegion>
</affiliation>
<affiliation><wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Retti, Gregor" sort="Retti, Gregor" uniqKey="Retti G" first="Gregor" last="Retti">Gregor Retti</name>
<affiliation wicri:level="1"><country xml:lang="fr">Autriche</country>
<wicri:regionArea>University of Innsbruck, Innsbruck</wicri:regionArea>
<wicri:noRegion>Innsbruck</wicri:noRegion>
</affiliation>
<affiliation><wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint><publisher>Oxford University Press</publisher>
<date type="published" when="2003-04">2003-04</date>
<biblScope unit="volume">18</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="77">77</biblScope>
<biblScope unit="page" to="88">88</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">C82C92A176F34CD3AE19FA346A45C9E863DBF21C</idno>
<idno type="DOI">10.1093/llc/18.1.77</idno>
<idno type="local">180077</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Automation</term>
<term>Coding</term>
<term>Digitizing</term>
<term>Electronic library</term>
<term>Metadata</term>
<term>Standardization</term>
<term>Standards</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Automatisation</term>
<term>Bibliothèque électronique</term>
<term>Codage</term>
<term>Métadonnée</term>
<term>Normalisation</term>
<term>Norme</term>
<term>Numérisation</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Automatisation</term>
<term>Codage</term>
<term>Normalisation</term>
<term>Norme</term>
<term>Numérisation</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This paper explains why and how the digitization project METAe applies METS (Metadata Encoding and Transmission Standard) as encoding scheme for automatically extracted metadata. In contrast to TEI (Text Encoding Initiative) and other markup languages, METS allows encoding of the whole range of structural, descriptive, and administrative metadata in a systematic way. As the METS schema permits the integration of other existing standards, it provides a highly flexible output that can be converted easily to the individual needs of digital libraries. An innovative aspect of the METAe data structure is the ALTO file (‘Analysed layout and text object’), which contains the layout structures as well as the text passages of book pages. Structural maps of the METS schema are used to compose the logical and the physical structures out of ALTO and image files.</div>
</front>
</TEI>
<affiliations><list><country><li>Autriche</li>
</country>
</list>
<tree><country name="Autriche"><noRegion><name sortKey="Stehno, Birgit" sort="Stehno, Birgit" uniqKey="Stehno B" first="Birgit" last="Stehno">Birgit Stehno</name>
</noRegion>
<name sortKey="Egger, Alexander" sort="Egger, Alexander" uniqKey="Egger A" first="Alexander" last="Egger">Alexander Egger</name>
<name sortKey="Retti, Gregor" sort="Retti, Gregor" uniqKey="Retti G" first="Gregor" last="Retti">Gregor Retti</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000244 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000244 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Ticri |area= TeiVM2 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:C82C92A176F34CD3AE19FA346A45C9E863DBF21C |texte= METAe—Automated Encoding of Digitized Texts }}
This area was generated with Dilib version V0.6.31. |